The raw dataset contains 7,728,394 observations (rows) of 46 variables (columns).
After data preparation and cleaning, the dataset contains 7,546,771 observations (rows) of 53 variables (columns).
The author defines severity as “the impact on traffic.” Low severity accidents would have a minimal effect on traffic whereas high severity accidents would have a significant impact on traffic.
We can observe that the majority of accidents that took place between 2016 and 2023 were categorized as “less severe,” accounting for 6,010,987 of the total 7,546,771 accidents.
We can observe that, when adjusted for population, the following states: South Carolina, California, Oregon, Florida, Minnesota, had the most accidents from 2016 to 2023.
While South Carolina had the most accidents per capita, the average severity was one of the lowest of all the states. The states that had the worst average severity were Georgia, Wisconsin, Rhode Island, Kentucky, and Colorado. While some states had a higher average severity than others, the largest difference in average severity was only 0.49.